The IBM Submission to the 2008 Text-to-Speech Blizzard Challenge
نویسندگان
چکیده
The 2008 Blizzard speech synthesis challenge provided participants with an opportunity to evaluate their systems in UK English and Mandarin. This paper describes the work behind three IBM systems submitted to the challenge for these two languages. The systems presented are concatenative unit-selection text-to-speech synthesis systems consisting of a core algorithmic base, as well as some algorithmic variants introduced not just to address the language-specific component of the synthesis engines (i.e., text-processing front-end) but also to better serve the different properties of different language types (i.e., tonal nature of Mandarin). The resulting systems were evaluated with several tasks designed to address issues like overall naturalness, intelligibility and the preservation of speaker identity. All the IBM systems submitted achieved very good performance in the two languages across the different tasks reported in this paper.
منابع مشابه
The IBM Submission to the 2006 Blizzard Text-to-Speech Challenge
In this paper, we present two concatenative text-to-speech systems built from the “Blizzard Challenge” speech databases. The two systems differ primarily in their segment selection cost function. One system has our baseline cost function, and the other has a cost function which has been altered to potentially better handle small datasets. Results indicate that both systems perform similarly in ...
متن کاملI 2 R ’ s Submission to Blizzard Challenge 2008
This paper reports the IR’s submission to the Blizzard Challenge 2008. This is our first participation in Blizzard Challenge. In this paper, we describe the approach that we used to build the three required voices. We introduced the acoustic parameters that include MFCC coefficients as spectral parameters in addition to the prosodic parameters for unit selection based speech synthesis. We used ...
متن کاملText - to - Speech System for Blizzard Challenge 2011
This paper describes IR‟s submission to the Blizzard Challenge 2011 speech synthesis evaluation. This is our fourth participation in the challenge. In this paper, we will describe our main approaches to building the required voices. We will describe our definitions of the acoustic, prosodic and linguistic parameters, procedure of candidate unit selection, components of cost functions, etc. Fina...
متن کاملSpeech Database Speech Analysis Training of MSD - HSMM Excitation parameters Spectral parameters Speech signal Context - dependent MSD - HSMMs and duration models Speech Parameter Generation
This paper describes the text-to-speech synthesis system developed for the Blizzard Challenge 2016 by members of the ADAPT centre and colleagues from associated projects. The task was to build a synthetic voice for reading audiobooks to children, from a speech database of audiobooks around 5 hours long. Our entry system is an HMM-based parametric speech synthesizer which was built using a subse...
متن کاملText - to - Speech System for Blizzard Challenge 2010
This paper describes IR’s submission to the Blizzard Challenge 2010 speech synthesis evaluation. This is our third participation in the challenge. In this paper, we will describe our main approaches to building the required voices. We will introduce the procedure of database processing, the definitions of the acoustic, prosodic and linguistic parameters, the components of cost functions, etc. F...
متن کامل